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PCT/DK98/00104 



.Title: ' An in vitro rtBthod for construction of a DNA library 



FIELD OF THE INVENTION 

The present invention relates to optimizing DNA sequences 
in order to (a) improve the properties of a protein of interest 
by artificial generation of genetic diversity of a gene encoding 
the protein of interest by the use of the so-called gene- or DNA 
shuffling technique to create a large library of '^genes", ex- 
pressing said library of genes in a suitable expression syszerr. 
and screening the expressed proteins in respect of specific chsr- 
acceristics to determine such proteins exhibiting desired proper- 
ties or (b) improve the properties of regulatory elements such as 
promoters or terminators by generation of a library of these ele- 
ments, transforming suitable hosts therewith in operable conjunc- 
tion with a structural gene, e.xpressing said structural gene and 
screening for desirable properties in the regulatory element. 

EAC?:3R0UND OF THE: i::ve::t:::: 

It is generally found :hat a protein performing a certain 
bioactivity exhibits a certain variation between genera and even 
between memlDers of the seme species differences m.ay exist. This 
variation is of course even m.ore outspoken at the qenom.ic level. 

This natural genetic diversity among genes coding for pro- 
teins having basically the same bioactivity has been generated in 
N^ature over billions of years anc reflects a natural optimization 
of the proteins codec for in respect of the environm.ent of the 
orcanism in question. 



wo 98/41653 ^ PCT/DKSS/OOKM 

In today's society the conditions of life are vastly re- 
moved from the natural environment and it has been found that the 
naturally occurring bioactive molecules are not optimized for the 
various uses to which they are put by mankind, especially when 
they are used for industrial purposes. 

It has therefore bee:, of interest to industry to identify 
such bioactive proteins that exhibit optimal properties in re- 
spect of the use to which it is intended. 

This has for many years been done by screening of natural 
sources, or by use of mutagenesis. For instance, within the tech- 
nical field of enzym.es for use in e.g. detergents, the washi.-.g 
and/or dishwashing per formance . of e.g. naturally occurring pro- 
teases, lipases, amylases and ceilulases have been improved sig- 
nificantly, by in vitro modifications of the enzymes. 

In m.ost cases these irr.prove.-nents have been obtained by 
site-directed mutagenesis resulting in substitution, deletion cr 
insertion of specific amino acid residues which have been chosen 
eitr.er on the basis of their type or on the basis of their loca- 
tion in the secondary or tertiary structure of the mature enzvT- 
(see for instance US patent no. 4/518,584). 

In this -anner tne preparation of novel polypeptide vari- 
ants anc "utants, s-cn as novel -odified enzyn^es with altered 
characteristics, e.c. specific activity, substrate specificity, 
ther-al, oH anc salt stability, pH-optir^ux, pi, K.^, V^^^, etc., 
has successfully been perforr:;ed to obtain polypeptides with L^- 
proved properties. 

For instance, within the technical field of enzyn^es the 
washing and/or dishwashmc performance of e.g. proteases, li 
oases, a-ylases and ceilulases have been improved significantly. 
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An alternative general approach for modifying proteins and 
enzymes has been based on random mutagenesis, for instance, as 
disclosed in US 4,894,331 and WO 93/01285 

As it is a cumbersome and time consuming process to obtain 
5 polypeptide variants or mutants with improved functional prop- 
erties a few alternative methods for rapid preparation of modi- 
fied polypeptides have been suggested. 

Weber et al., (1983), Nucleic Acids Research, vol. 11, 
5661, describes a method for modifying genes by in vivo recom- 
10 bination between two homologous genes. A linear DNA sequence com- 
prising a plasmid vector flanked by a DNA sequence encoding al- 
pha-1 human interferon in the 5 ' -end and a DNA sequence encoding 
alpha-2 human interferon in the 3'-end is constructed and trans- 
fected into a rec A positive strain of E. coll. Recombinants were 
15 identified and isolated using a resistance marker. 

Pompon et ai., (19S9), Gene 83, p. 15-24, describes a 
method for shuffling gene co-f-ins of mamjnalian cytochrome P-450 
by in vivo recombination o: partially homologous sequences in 
Saccha ror.yces ceravisiae by transforming Saccharomyces cerevisiae 
20 with a linearized plas~ic with filled-in ends, and a DNA fragm.eni: 
being psrtiallv horp.olocous to the ends of said plasmid. 

In WO 97/07205 a rr.ethod is described whereby polypeptide 
variants are prepared bv shuffling different nucleotide sequences 
o: homologous D:-:.-. sequences by in vivo recorrjoinat ion using plas- 
25 micic DN.n as te.^^.plate. 

U5 patent no. 5,053/257 (Assignee: Genencor Int. Inc.) dis- 
closes a method for producing hybrid polypeptides by in vivo re- 
combination. Hybrid DNA sez-^er.zes are produced by forming a cir- 
cular vector comprising a r-:jr:. icat ion sequence, a first DNA se- 
30 cuence encoding the amino-::-ir::Mr.al portion of the hybrid poly- 
peptide, a second DNA sequence encoding the carboxy-terminal por- 
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tion of said hybrid polypeptide. The circular vector is trans- 
formed into a rec positive microorganism in which the circular 
vector is amplified. This results in recombination of said circu- 
lar vector mediated by the aaturally occurring reccnbinaticn 
mechanism of the rec positive microorganism, which include prc- 
karyotes such as Bacillus and £. coli, and eukaryotes such as 
Saccharomyces cerevisiae. 

One method for the shuffling of homologous DNA sequences 
has been described by Stemmer (Stemmer, (1994), Proc. Natl. Acaz. 
Sci. USA, Vol. 91, 10747-10751 ; Stemmer, (1994), Nature, vol. 
370, 389- 391) . The method concerns shuffling homologous DNA se- 
quences by using in vitro PGR techniques. Positive recombinan" 
genes containing shuffled DMA sequences are selected from a 
library based oh the improved . function of the expressed proteins. 

The above method is also described in WO 95/22625. WO 
95/22625 relates to a method for shuffling of homologous DNA se- 
quences. An important step in tne method described in WO 95/22625 
is to cleave the homologous ter.plate double-stranded polynuclec- 
tide into rando.T. fragments of a desired size followed by hom.olc- 
cousiy reassembling of the fragments into full-length genes. 

A disadvantage inherent to the method of v;o 95/22625 is, 
hov;ever, that the diversicv generated through that method is lim- 
ited cue to the use of homologous gene sequences {as defined in 
v;0 95/22625) . 

Another disadvantage in the method of WO 95/22625 lies in 
the production of the random fragments by the cleavage of the 
template double-stranded polynucleotide. 

A further reference of interest is WO 95/17413 describing a 
method of gene or DNA shuffling by recombination of specific CNA 
sequences - so-called design elements (DEL) - either by recombina- 
tion of synthesized double-stranded fragments or recombination cf 
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PGR generated sequences to produce so-called functional elements 
(FE) comprising at least two of the design elements. According 
to the method described in WO 95/17413 the recombination has to 
be performed among design elements that have DNA sequences with 
sufficient sequence homology to enable hybridization of the dif- 
ferent sequences to be recorabined. 

WO 95/17413 therefore also entails the disadvantage that 
the diversity generated is relatively limited. Furthermore the 
method described is time consaming, e.xpensive, . and not suited for 
automat isat ion . 

Despite the existence of the above methods there is still a 
need for better iterative in vitro recombination methods for pre- 
paring novel polypepcide variants. Such methods should also be 
capable of being performed in srr.all volumes, and amenable co 
automatisation . 

suMV-^.RY or the: invention 

The present invention concerns briefly the utilization of 
ter.plate shift of a newly synthesized DNA strand during in vitro 
DNA synthesis m order to achieve DNA shuffling. By using this 
tecnnique it is possible to obtain such results m a more expedi- 
ent manner, anc to 5C~e extent even a greater variation than in 
the above mentioned rr.ethocs. 

The m.ethoc of the invention is also very well suited for 
adaption to automatisation. 

In a preferred e~r)odi-ent the technique is used in combina- 
tion with an error-prone poli.-erase thereby introducing an even 
greater variation in the library created. 

More specifically the present invention relates to a method 
for- the construction c: a library of recoPbined hom.oicgous 
polynucleotides from a nur^er of different starting single or 
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double stranded parental DNA templates and primers by induced 
template shifts during an in vitro polynucleotide synthesis usin; 
a polymerase, whereby 

A. extended primers or pc.ynucleotides are synthesized by 

a) denaturing parental double stranded DNA templates -= 
produce single stranded templates, 

b) annealing said primers to the single stranded DN'A ter- 

plates, 

c) extending said primers by initiating synthesis by use =: 
said polymerase, 

d) cause arrest of the synthesis, and 

e) denaturing the double, strand to separate the extended 
primers from the templates, 

B. a terr.plate' shift is induced by 

a) isolating the newly synthesized single stranded extend- 
n-^T^-s fro- the te-?lstes and repeating steps A.b) to 
A.e) using said extended primers produced in (A) as both 
orimers and te-piates, or 

b) repeating steps A-'n) to A.e), 

C. the above process is 



r-inated after an appropriate nun-b^r 



of cycles of process steps A. and B.a), A. and B.b), or ccr.- 
binations thereof, anc 
-0 ootionaliv the produced polynucleotides are amplified in = 
standard ?CR reaction v,-ith specific primers to selective! 
a~.?lify hc-ologous polynucleotides of interest. 

In soecific erriodir-nts various modifications can be m.ad 
th^'orocess of the invention. For exa^.pie it is advantagetu 
to aoolv a defective polyr.erase either an error-prone oolymeras 
to -introduce mutations m comparison to the templates, or a ?c 
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lymerase that will discontinue the polynucleotide synthesis pre- 
maturely to effect the arrest of the reaction. 

Further modifications will be described below. 
In a further aspect the invention relates to a method of 
5 identifying polypeptides exhibiting improved properties -in com- 
parison to naturally occurring polypeptides of the same bioactiv- 
ity, whereby a library of recombined homologous polynucleotides 
produced by the above process are cloned into an appropriate vec- 
tor, said vector is then transformed into a suitable host system, 
10 to be expressed into the corresponding polypeptides and dis- 
played, said polypeptides are then screened in a suitable assay, 
and positive results selected. . 

In a still further aspect the invention relates to a method 
for producing a' polypeptide of interest as identified in the pre- 
15 ceding process, whereby a vector comprising a polynucleotide en- 
coding said polypeptide is transformed into a suitable host, said 
host is grown to express said polypeptide, and the polypeptide 
recovered and purified. 

Finally, in further final aspects the invention 
20 relates to a recorrbinec/ shuf f led protein, which is obtainable by 
any of the methods according to the invention, and which is a 
reco-bined/shuffled protein comprising the sequences disclosed 

herein (vide infra). 

In those final aspects of the invention, the terrr^. 
25 "obtainable" denotes that said protein is preferable obtained by 
a method according to the invention. However a prior art known 
recombination/shuffling technique such as those described in v;0 
95/22625 or v;o 95/17413 may be used too, either alone or in 
combination with a method according to the invention, in order to 
30 obtain said recombined protein. 

Accordingly, further final aspect of the invention are; 
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a recombined/shuffied protease obtainable by any of the 
methods according to the invention, and comprising a recombined 
sequence, which at least contain two different partial sequences 
from at least two different wild-type proteases; 
5 a recombined/shuffied lipase obtainable by any of the 

methods according to the invention, and comprising a recombined 
sequence, which at least contain two different partial wild-type 
sequences 

a recombined/shuffied Pseudomonas lipases obtainable by any 
10 of the methods according to the invention, and comprising a 
recc-bined sequence, which at least contain two different partial 
sequences from at least tw.o of the different wild-type 
Pseudo-onas lipases; 

a reco:?bined/shuf fled y.yianase obtainable by any of the 
15 methods according to the invention, and comprising a recombined 
sequence, which at least contcir. two different partial sequences 
from at least two different wilc-type xylanases; 

a recorrbined/sh'jffled ceiiulase obtainable by any of the 
-.othcds according to the invention, and comprising a recorrjoined 
:: so-v.once, which at least contain two different partial sequences 
iirm at least two differen: wild-type cellulases; 

=. recorrbmed/shufilez amylase obtainable by any of the 
methods according to the invention, and comprising a recombined 
sequence, which at least contain two different partial sequences 
25 from at least two different wild-type amylases 

a recorrbined/shuffled laccase obtainable by any of the 
methods according to the invention, and comprising a recombined 
soquonco, which at least contain two different partial sequences 
from at least two different wild-type laccases; 
30 . a recombined/sh'ufflec rhytase obtainable by any of the 

m.ethod.-. according to the invention, and comprising a recorrbined 
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sequence, which at least contain two different partial sequences 
from at least two different wild-type phytases. 



5 DEFINITIONS 

Prior to discussing this invention in further detail, the 
following terms will first be defined. 

"Shuffling": The term "shuffling" means recombination of 
nucleotide sequence fragment (s) between two or more homologous 
10 polynucleotides resulting in output polynucleotides (i.e. polynu- 
cleotides having been subjected to a shuffling cycle) having a 
number of nucleotide fragments exchanged, in comparison to the 
input polynucleotides (i.e. starting point hom.ologous polynucleo- 
tides) . 

15 "Homology of DN.^. sequences or polynucleotides" In the pres- 

ent context the degree of D:;.-. sequence homology is determined as 
the degree of identity between tvo sequences indicating a deriva- 
tion of the first sequence from the second. The homology may 
suitably be ceterminec by means of computer programs known in the 

20 art, such as G.h? provided in the GCG program package (Program 
var.uai for the Wisconsin Package, Version 8, .August 1994, Genet- 
ics Computer Group, 57 5 Science Drive, .\adison, Wisconsin, US.n 
53711 ) (Meedleman, S.5. and Wunsch, C.D., (1970), Journal of Mo- 
lecular Biology, 43, 443-453) . 

25 "Homologous": The term "homologous" means that one single- 

stranded nucleic acid sequence may hybridize to a complementary 
single-stranded nucleic acid sequence. The degree of hybridiza- 
tion may depend on a nu.mher of factors including the amount of 
identity between the sequences and the hybridization conditions 

30 such as temperature and salt concentration as discussed later 
( vice infra) . 
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Using the computer program GAP (vide supra) with the fol- 
lowing settings for DNA sequence comparison: GAP creation penal-y 
of 5.0 and GAP extension penalty of 0.3, it is in the preser.t 
context believed that two ONA sequences will be able to hybridi-e 
(using low stringency hybridization conditions as defined belov) 
if they mutually exhibit ^ degree of identity preferably of =c 
±east 70%, more preferably at least 80%, and even more preferably 
at least 85%. 

"heterologous": If two or more DNA sequences mutually ex- 
hibit a degree of identity which is less than above specified, 
they are in the present context said to be "heterologous". 

"Hybridization:" Suitable experimental conditions icr 
determining if two' or more DN.-. sequences of interest do hybridize 
or not is herein defined as hybridization at low stringency as 
described in detail belov;, 

Molecules to which the oligonucleotide probe hybridizes 
under these conditions are detected using a x-ray film or a phos- 
phoir.ager . 

"primer": The ter- -prir.er" used herein especially in ccr.- 
p.ection v;ith a ?CR reactior. is an oligonucleotide (especially a 
"?C?.-orirr.er") defmec ar.z constructed according to general star.- 
darc specifications --.ncvn m the art ("PCR A practical approach" 

IRL Press, (1S91) ) • 

"A primer directec to a sequence:" The term "a prim.er di- 
rected to a sequence" means that the primer (preferably to be 
used m a ?C?. reaction) is constructed to e:-:hibit at least 60% 
decree of sequence identity to the sequence fragment of interest, 
m.ore preferably at least 90^ decree of sequence identity to the 
sequence fragment of interest, v;hich said primer consequently is 
"directed to". The primer is designed to specifically anneal a^ 
the sequence fragm.ent or region it is directed towards at a give 
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temperature. Especially identity at the 3' end of the primer is 
essential . 

"Flanking" The term "flanking" used herein in connection 
with DNA sequences comprised in a PCR-f ragment means the outer- 
most partial sequences of the PCR-f ragment, both in the 5' and 3' 
ends of the PCR fragment. 

"Polypeptide" Polymers of amino acids sometimes referred 
to as proteins. The sequence of amino acids determines the 
folded conformation that the polypeptide assumes, and this in 
turn determines biological properties and activity. So.Te 
polypeptides consist of a single polypeptide chain (monomeric) , 
whereas other comprise several associated polypeptides 
(multimeric) . All 'enzyir-s and antibodies are polypeptides. 

"Enzyme" A protein capable of catalysing chemical reac- 
tions. Specific types of enzyrr.es to be mentioned are such as 
a-ylases, proteases, carbohydrases , lipases, cellulases, oxi- 
doreductases, esterases, etc. Of specific interest in relation 
to the present invention are enzyr.es used in detergents, such as 
proteases, lipases, cellulases, amylases, etc. 



DETAILED DESCRIPTION OF THE INVi^NTION 

The oresent invention relates in its first aspect to a 
method for the construction of a library of recombined homologous 
polynucleotides from a number of different starting single or 
double stranded parental DNA ter.olates and primers by induced 
template shifts duri.ng an in vitro polynucleotide synthesis using 
a polyrr.erase, whereby 

A. extended primers or polynucleotides are synthesized by 

a) denaturing a parental double stranded DMA template to 
oroduce single stranded templates. 
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b) annealing said primers to the single stranded DNA tem- 
plates, 

c) extending said primers by initiating synthesis by use c: 
said polymerase, 

5 d) cause arrest of the synthesis, and 

e) denaturing the double strand to separate the extenc^z 
primers from the templates, 

B. a template shift is induced by 

a) isolating the newly synthesized single stranded extended 
10 primers from the templates and repeating steps A.b) zz- 

A.e) using said extended primers produced in (A) as bc"- 
primers and templates, or 

b) repeating steps A.b) to A.e), 

C. the above process is terminated after an appropriate number 
15 of cycles of process steps A. and 3. a), A. and 3.b), or ccr- 

binations thereof, and 

D. optionally the produced polynucleotides are amplified in a 
standard PGR reaction with specific primers to selectively 
a-plify homologous polynucleotides of interest. 

20 

Accordinc to the invention the polymerase may be a DNA or £ 
RN'A ool^/T.erase, specific polym.erases to be mentioned are such as 
DNA polymierases like 74 polymerase, T7 polym.erase , E. coll DSA 
pol\-r;erase I or the Klenow fragment of DMA polyTr^erase I, or ther- 
25 mostable polymerases such as Tac, .Amplitaq, Vent, Pwo. 

One of the advantages of the invention is that it makes it 
possible to control the length of the extension of the primers in 
the reaction in a convenient manner. 

This can be accomplished by various means such as choice :: 
30 polymerase, the physical and chemical conditions during the az- 
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tion of the polymerase, e.g. pH, temperature, buffer, salt con- 
centration, and addition of various chemicals. 

It is known that various polymerases carry out the DNA syn- 
thesis at different rates (nucleotides incorporated pr. second) . 
For example has the Klenow fragment of DNA polymerase I a limited 
extension rate compared to e.g. the Taq polymerase (Sambrook et 
al. 1989) . 

Polymerases also display differences in processivity, which 
is the average number of nucleotides incorporated before the po- 
lyn-.erase dissociates from the template/extended primer; again the 
Klenow polymerase is an example of a polymerase with limited 
processivity. 

The choice of polymerase is therefore an important m.eans in 
controlling the average extension of the primers. 

These conditions may also exert an influence on the fidel- 
ity of the polym.erase (the rate by which point mutations are in- 
troduced; HIV reverse transcriptase is an example of a polymerase 
of lov; fidelity), a parameter useful in combining shuffling and 
rr.utagenesis . 

20 In specific erbodi-er.ts various modifications can be made 

ir. the process of the invention. For example it is advantageous 
to apply a defective poly~erase either an error-prone polym.erase 
to introduce mutations in com.parison to the templates, or a po- 
Iv-erase that will discontinue the polynucleotide synthesis pre- 
-aturely to effect the arrest of the reaction. Such a defective 
poiy-erases that could be mentioned is a Klenow polymerase having 
low processivity. 

In another errbociment c: the invention polym.erase will be 
added after each cycle, if tne poly-erase used is not thermo- 
30 stable. 



15 



25 



wo 98/41653 14 PCT/DK98/00104 

According to the invention the starting single" or double 
stranded parental templates may be different in that they contain 
different point mutations in the same native polynucleotide 
(gene), or they can be homologous polynucleotides (genes) iss- 
5 lated from nature, which may be amplified by PGR, or they can be 
combinations thereof. The templates used in the process of the 
invention are hereby homologous showing an identity at the D)l-. 
level of e.g. more than 95%, 90%, 85%, 80%, 75%, 70%, 65%, 60*, 
55%, or even more than 50% identity. 

10 It may be advantageous to use pre-selected templates ccz- 

prising mutations with improved properties of interest. The pres- 
ent recombination m.ethod of the invention will then recombir.e 
said improved mutations for subsequent screening for even further 
improvements in the properties of interest. 

15 " said pre-selected templates with improved properties of ir.- 
terest may have been identified oy standard procedures in the art 
comorising e.g. i) error-prone ?C?, of templates of interest fol- 
lowed by ii) screeninc/selectior. for templates with improved 

.^o^ic-i- ;p--'-s- "--2 mutagenesis frequency (low or 
cr.aracterisv-i^ j-i-^ ^ 

20 man mutagenesis frequency) of zr.e error-prone ?CR step is pref- 
erably adjusted in relation to the subsequent screening capacity, 

i.e. ir tne screen^..- capd^j-^y _b 

^^r^^.'^i.. f ^ cn^ to t:v;o nutations in each 

te-piate) (see WO 92/15545 -for further details). 

25 " The arrest of the polv-rr.erase reaction in step A.d) nay as 

indicated above be obtained in different ways, such as by raising 

. r^v- p^H^n- s-----'^"c -eaaents as described in v;0 
the cerp-oerature / or a^cm-^ h^^- ^ --a^ 

95/17413. 

When raising the temperature for this purpose, it is pre- 
30 ferred to use temperatures between rO^C anc 99V. 



wo 98/4 1 653 ^ ^ PCT/DK98/00 1 04 

When using chemical agents DMSO is a possibility. Appropri- 
ate procedures are mentioned in e.g. WO 95/17413. 

The process of the invention uses annealing of the primers 
to the templates in step l.b. In this context the annealing may 
5 be random or specific, meaning either anywhere on the polynucleo- 
tide or at a specific position depending on the nature of the 
primer. 

Also, the primers to be used may be completely random prim- 
ers (NNNNNNNNNNN) (N meaning a mixture of the four bases (A, T, 
10 G, C) is used at a particular position in the primer during syn- 
thesis), semi-random primers, or specific primers. 

If the extended primers produced are to be separated from 
the primers during the process it is convenient to use labeled 
templates in order to provide a simple means for separation, a 
15 preferred marker is biotin or digoxigenin . 

According to the invention the number of cycles necessary 
v;ill be less than 500, in most cases less than 200, and normally 
less than 100 cycles. 

In an errJDOciment of the invention the above in vitro shuf- 
20 flinc is corjDined wich a subsequent in vivo shuffling by methods 
such as those described in v:0 97/07205. 

In its second aspect the invention relates to a metthod of 
identifying polypeptides exhibiting improved properties in com- 
parison to naturally occurring polypeptides of the sam.e bioactiv- 
25 itv, v/hereby a library of recorrbined homologous polynucleotides 
produced by the above process are cloned into an appropriate vec- 
tor, said vector is then tran5fonr;ed into a suitable host system, 
to be expressed into the corresponding polypeptides, said 
polypeptides are then screened in a suitable assay, and positive 
30 polyoeptides selected. 
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In an embodiment it is contemplated that the polypeptides 
of interest encoded by the shuffled library are expressed as a 
suitable fusion protein (e.g. as a hybrid with glll of bac- 
teriphage M13/fd) in order to display said recombined polypeptide 
5 on the surface of phage or bacteria. 

In a third aspect the invention relates to a m.ethod fcr 
producing a polypeptide of interest as identified in the preced- 
ing process, whereby a vector comprising a polynucleotide encod- 
ing said polypeptide is transformed into a suitable host, said 
10 host is grown to express said polypeptide, and the polypeptide 
recovered and purified. 

The use of partial random (semi-random) or completely ran- 
dom prirr.ers (mixtures of bases in a selected numJoer or all posi- 
tions in the primer) as initiation point for the DNA synthesis 
15 provide certain novel possibilities for the combined use of shuf- 
fling and random mutagenesis. 

it is often associated v;irh difficulties to obtain an i.i 
vitro recombination of polyn-c ieot ides that display relatively 
limited homology. 3y the use of an embodiment of the inventic- 
20 even verv diverse pol vnucieotices can be forced to recorrbine. 

According to that erbociment at least two templates (or tvo 
pools of diverse ter.olates) are applied. The novel synthesis o: 
the one pol vnucleot ide can then be based on only one strand (i.e. 
either the sense or the anti-sense strand) , and the synthesis c: 
25 the other polynucleotide is based the opposite strand. 

This can be accomplished by isolating the complementary 
strands from the two templates, e.g. by having these strands la- 
beled bv biotin. Synthesis of DNr. is initiated by annealing ei- 
ther specific, partly random or co";pietely random primers zo 
30 these templates and adding a suitable pol^/m.erase . This can re 
perform.ed as either separate reactions for the different te-- 
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plates or in just one reaction. Synthesis should preferably be 
performed under conditions that favor production of relatively 
short new fragments. These fragments can subsequently be isolated 
from the templates based on the affinity label. A PGR reaction is 
carried out on these fragments and as the starting material 
originates from two different strands, the newly synthesized 
fragments must recombine in order to produce full length FCR 
products - a kind of forced recombination. 

Also for this embodiment rapid PGR with short or no exten- 
sion time can be applied advantageously in order to enhance re- 
combination, especially if pools of templates are used for the 
two strands. 

The length 'of the primer and the annealing temperature 
utilized in the' process determines if random primers will anneal 
and the numJoer of mismatches between the template and the primer 
that can be accorrmodated . By varying the primer length and the 
annealing temperature the method of the invention provides a 
means for achieving random n'Jtacenesis within a certain nucleo- 
tide windov; representing the length of the primer. The method of 
the invention thereby provides substantial benefits compared to 
other random mutagenesis approaches; especially the high prob- 
ability for several base substitutions close to each other in the 
primary sequence, e.g. the use of a completely random 20'mer 
(m.ixture of all four nucleotides in all 20 positions) will ac- 
cording to theory under given experimental conditions give a cer- 
tain reasonably high probability for having several base substi- 
tutions close to each other. 

Error prone PGR {=hich m.utagenesis frequency PGR) does nor 
provide this possibility. Error prone PGR provides a very low 
probability for having m^ore tihan one base substitution within one 
codon (coding for one amino acid m a translated polypeptide) . 
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Obviously the substitution of only one base within a codor. 
doesn't provide total random mutagenesis (at protein level) as 
only a limited set of amino acid substitutions can be obtained by 
one base substitution at DNA level (e.g. Methionine encoded by 
5 ATG-codon requires three base substitution to become the TGT or 
TGC-codon encoding Cysteine) . 

In one embodiment of the invention the process is therefore 
performed by using random or semi-random primers having a length 
of from 6 to 200 bp, preferably from 10 to 70 bp, and better fro.- 
10 15 to 40 bp. 

One of the advantages in the method of the invention is the 
robustness. In some e-ibodiments the constant presence of full 
length template provides a furcher advantage avoiding PCR con- 
tam.ination problems. Furthermore iz is much less laborius, with 
15 less hands on, than other described methods, thereby providing 
excellent possibilities for au::c"ation. 



PCR-primers : 

The PCR primers are cor;s::ruct:ed according to the standard 
20 descriptions in che arc. Nor-aliy they are 10-75 base-pairs (bp) 
long. However; for the specific errjDodim.ent using random cr semi- 
random primers the ler.ctr; -ay be substantially longer as indi- 
cated above. 



2 5 PCR-reactions : 

If not otherwise menticnec the ?CR-reaction perforrried ac- 
cording to the invention are performed according to standard pro- 
tocols known in the art. 

The term ''Isolation of : Zr. fragment" is intended to cover 
30 as broad as simply an aliquc. ccr.taining the PCR fragment. How- 
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ever preferably the PGR fragment is isolated to an extend which 
remove surplus of primers, nucleotides templates etc. . 

In an embodiment of the invention the DNA fragment (s) 
is (are) prepared under conditions resulting in a low, medium or 
5 high random mutagenesis frequency. 

To obtain low mutagenesis frequency the DNA sequence (s) 
(comprising the DNA fragment (s)) may be prepared by a standard 
PGR amplification method (US 4,683,202 or Saiki et al., (1988), 
Science 239, 487 - 491) . 

10 A medium or high mutagenesis frequency may be obtained by 

performing the PGR amplification under conditions which increase 
the misincorporation of nucleotides, for instance as described by 
Deshler, (1992), GATA 9(4), 103-106; Leung et al., (1989), Tech- 
nique, Vol. 1, No. 1, 11-15. 

1,5 It is also contemplated according to the invention to com- 

bine the PGR amplification (i.e. according to this embodiment 
also DNA fragment m.utation) with a mutagenesis step using a suit- 
able physical cr chemical mutacenizing agent, e.g., one which in- 
duces transitions, transversions , inversions, scrarriDling , dele- 

20 tions, and/or insertions. 

ELxoressinc the rec orbinant protein from the recorrJoinant shuffled 
sequences 

Exoression of the recorrvbinant protein encoded by the shuf- 
25 fled seauence in accordance with the second and third aspect of 
the oresent invention nay be performed by use of standard expres- 
sion vectors and corresponding expression system.s known in the 
art . 

30 Screening and selection 
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In the context of the present invention the term "positiTe 
polypeptide variants" means resulting polypeptide variants pos- 
sessing functional properties which has been improved in compari- 
son to the polypeptides producible from the corresponding input 
DNA sequences. Examples, of such improved properties can be a= 
different as e.g. enhance or iowered biological activity, i."- 
creased wash performance, thermostability, oxidation stability, 
substrate specificity, antibiotic resistance etc. 

Consequently, the screening method to be used for identify- 
ing positive variants depend on which property of the polypeptide 
in question it is desired to change, and in what direction tr.e 

change is desired. 

A number of suitable screening or selection systems 
screen or select for a desired biological activity are described 
in the art. Examples are: 

Strauberg et al. ( Hiotechnology 13: 66S-673 (1995) de- 
scribes a screening syste- for siibtiiisin variants having Cal- 
ciu-- independent stability; 

Bryan ec al. (Proteins 1:326-33h (1935)) describes i 
screening assay for proteases having a enhanced thermal stabil- 
ity; and 

PCT-DK9S/00322 describes a screening assay for lipases har- 
Lnz improved wash per f orrr.ar.ce in washing detergents. 

An e-JDodi-ent of the invention comprises screening or se- 
lection of reccmJDinant protein(s), wherein the desired biologica" 
activity is performance in cish-v/ash or laundry detergents. E:-:a.-. 
pies of suitable dish-wash or laundry detergents are disclosed i 
PCT-DK95/00322 and ViO 95/30011. 

If, for instance, the polypeptide in question is an enz>- 
and .the desired improved functional property is the wash perforr 
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ance, the screening may conveniently be performed by use of a 
filter assay based on the following principle: 

The recombination host cell is incubated on a suitable me- 
dium and under suitable conditions for the enzyme to be secreted, 
the medium being provided v/ith a double filter comprising a first 
protein-binding filter and on top of that a second filter exhib- 
iting a low protein binding capability. The recombination host 
cell is located on the second filter. Subsequent to the incuba- 
tion, the first filter ccr.prising the enzyme secreted from the 
recorrbination host cell is separated from the second filter com- 
prising said cells. The first filter is subjected to screening 
for the desired enzym>atic activity and the corresponding micro- 
bial colonies present on the second filter are identified. 

The filter used for binding the enzymatic activity may be 
any protein binding filter e.g. nylon or nitrocellulose. The too- 
filter carrying the colonies o: the expression organisri may be 
any filter that has no or low affinity for binding proteins e.g. 
cellulose acetate or Durapore-^. The filter may be pre-treated 
with any of the conciticns to be used for screening or may be 
treated d urine the detection of enz^rr.atic activity. 

The enz^7?.atic activitv r.av be detected by a dye, fluores- 
cence, precipitation, pH indicator, IR-absorbance or any other 
k n o v; n t e c h n i c u e for detection of e n z VTTia tic activity. 

The detecting ccrr.oound mav be irr^mobi 1 i zed by any im^Tiobiliz- 
ing agent e.g. agarose, agar, gelatin, polyacrylamide , starch, 
filter paper, cloth; or any corrJoinat ion of immobilizing agents. 

If the improved functional property of the polypeptide is 
not sufficiently good after one cycle of shuffling, the polypep- 
tide may be subjected to another cycle. 

- In an embodiment of the invention wherein homologous 
Dolvnucleotides representinc a nu^joer of mutations of the sarr.e 
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gene is used as templates at least one shuffling cycle is a back- 
crossing cycle with the initially used DNA fragment, which may be 
the wild-type DNA fragir.ent. This eliminates non-essential muta- 
tions. Non-essential mutations may also be eliminated by using 
5 wild-type DNA fragments as the initially used input DNA material. 

Also contemplated to b3 within the invention is polype?- 
ti-^as having biological activity such as insulin, ACTH, glucagon, 
somatostatin, somatotropin, thymosin, parathyroid hormone, pitu- 
ary hormones, somatomedin, erythropoietin, luteinizing hormone, 

10 chorionic gonadotropin, hypothalamic releasing factors, antidiu- 
retic hormones, thyroid stimulating hormone, relaxin, interferon, 
thrombopoeitin (TPO) and prolactin. 

A requirement to the starting parental DNA sequences, en- 
coding the polypeptide (s) , to be shuffled, is that they are az 

15 least 50%, 60%, 70%, SOI, 90>., or 95% homologous. DNA sequences 
being less homologous will nave less inclination to interact and 
recorribine . 

It is also conte-plated according to the invention to shuf- 
fle parental polynucleotides that are homologous as indicated 
20 above originating fro- wilc type organisms of different genera. 

further, the startir.c parental templates to be shuffled may 
oreferably have a lenctr. of fro- about 50 bp to 20 kb, preferably 
about 100 bo to 10 Kb, r.ore oreferred about 200 bp to / kb, espe- 
cially about 4 00 bp to 2 kb. 
25 The starting oarental DNA sequences may be any DNA se- 

quences including wild- type DNA sequences, DNA sequences encoding 
variants or mutants, or modifications thereof, such as extended 
or elongated DNA sequences, and may also be the outcome of DNA 
sequences having been subjected to one or more cycles of shuffi- 
30 inc (i.e. output DNA sequences) according to the method of the 
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invention or any other method (e.g. any of the methods described 
in the prior art section) . 

When using the method of the invention the resulting recon- 
bined homologous polynucleotides (i.e. shuffled DNA sequences), 
5 have had a number of nucleotide fragments exchanged. This results 
in replacement of at least one amino acid within the polypeptide 
variant, if comparing it with the parent polypeptide. It is to be 
understood that also silent exchanges are contemplated (i.e. nu- 
cleotide exchange which does not result in changes in the amino 
10 acid sequence) . 

MATERIALS AND METHODS 

Specific Method Used in the Examples: 

A) DNA encoding different enzyr^e variants of the same gene or 
15 different enzymes having the s^-e type of activity encoded by 

homologous genes are mixed. Tr.e DNA is provided as either PCR 
fragments, plasmid, phage or cenorr.ic DNA. 

B) The resulting pool of DN'A is mixed with DNA Polymerase, 
dNT?, a suitable buffer and primers (being either random oli- 

20 comers (length of o-3C nucleosides ) or specific oligomers 
(lenoth of 6-50 nucleotides) or a combination of both types). 

C) The PCR mixture is out into a PCR thermocycler (either 
cold or hot) in a suitable tube. 

cl) The thermocycler is heated to a temperature of SC- 
25 lOO^C for a period of time ( typically 1-10 min) in order to dena- 
ture the DNA templates . 

c2) Thereafter the follovinc procedure (cycle) is fol- 
lov;ed (repeated) : The template is denatured (typically 90-100°C 
for 0-5 minutes) . Then the terp-^rature is lowered (typically to 
30 a value between 10°C and 90^C f^r G-5 minutes) to allow annealing 
of the primer to the single stranded template. Nov/ the tempera- 
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ture is raised again to denaturation temperature (90-100°C) al- 
lowing small extension of the primer to be synthesized by the 
DNA polymerase during ramping. Alternatively a short extensicr: 
period (typically 0-30 seconds at 70-75°C) can be introduced zz 
5 allow larger extensions of the primers to be generated. When the 
temperature reaches a value where denaturation takas place, the 
extended primers and templates are again separated. This proce- 
dure can be repeated (typically between 1 to 99 cycles). 
D) Having performed the desired number of cycles the gener- 

10 ated small DNA polym.ers can be purified from the oligomers usei 
as primers. One way is to isolate and clone a specific amplified 
band containing the gene codinc^ for the polypeptide of interes: 
into a suitable vector. This can be done either on agarose gel 
(typically isolating fragments between 50 to 1000 base pairs), 

15 by beads (using an affinity label on either templates or pri"- 
ers) or through columns. 

Z) Then the purified (or uhe .".ot purified) DMA polym.ers car. 
be asse-Joled in a standard rCR reaction (for instance 94°C, : 
minutes, (94°C, 30 sec; 55=C, 50 sec; 72°C, 2 min) * 25, 72°C 5 

20 minutes, ^"C) . 

Scecific orimers cr DNA colv^ers generated by specifi- 
primers can be added in order to generate a specific DNA polymer 
containing the cene of interest. This As mentioned in point D, 
this DNA polymier can be purified and cloned into a vector of in- 

2 5 terest . 

e::^J'1?le:5 



30 e:<a]-i?le: i 
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Method 1 

The strong advantage, of the method exemplified here is the 
robustness and lack of PGR contamination problems, due to the 
constant presence of parental template. Furthermore this method 
is less labor demanding than methods described in the prior art, 
thereby providing excellent possibilities for automation. 

Nine different plasmids containing DNA sequences encoding 
9 different variants of the H. lanuginosa lipase gene, were 
mixed in equimolar amounts. The variant genes contained from two 
to seven mutations scattered throughout the gene. 

The DNA sequence of the H. lanuginosa lipase gene and the 
amino acid sequence of the lipase are disclosed in EP 0 305 216 

The variants are indicated according to standard terminol- 
ogy as described in e.g. EP 0 3S6 609, and WO 97/07202. 

The following 9 variant genes were shuffled: 



N94K-^D96L + E99K 

SP??.R? + N94K^D9oL-^T2 31RTN2 33?,rD2 34?.^Q2 4 9R 
S ? ? R R ? 1 9 C + C 3 6 .-. - :■ i ? 4 :■' D 5 5 L ^ Q 2 4 9 R 

ST?RR?-^N94R 

SC I RR-N 9 4 Kr D9 6L-f E2 3 9C-Q2 -. 9 R 

D137GTDl57G-fE210V 

D95LrE99K-^Vie7;-. 

S??RR? + D57G-rM34K-i-09 6L-^Q2 4 9R 

;-;94R-^-F95L 



following components where -ix.ed in a microtube: 

2 ^1 pias~iG n-.i:-.ture (C.i5 uc/^1), specific primers flank- 
___■,/..•., ,.- '. T.N^ riMT?. 2.5 rr-M MqCl,, 2 li1 



10 



lb 
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10* taq buffer (Perkin Elmer), 0 . 5 ^1 taq enzyme in a total vol- 
ume of 20 (il. 

The tube was set in a Perkin Elmer 2400 thermocycler . The 
following FCR-program was run:(94°C, 5 minutes) 1 cycle: 
ore , 30 seconds, 70"C, 0 seconds) 99 cycles {72''C, 2 minutes, 

i'Z indefinite) 1 cycle 

The PCR-reaction was run on a 1.5 % agarose gel. A DN.^^.- 

band of the specific expected size was cut out of the agarose 
gel and purified using JETsorb (from GENOMED Inc.). The purified 
PCR-product was cloned into a T.^.-vector {from Invitrogen (the 
original T.^ cloning kit). The iigared product was transformed 
into a standard Escherichia coli scrain (DK5a) . 

20 transfornants where fully sequenced across the gene of 

interest . 



Result : 

The following 20 variants v;ere found: 

1 . D13 7GrDl 670^12 lOVr: 21 5C 

2 . s???.R?-^D57G':'i9-'iKrD95L.-HQ2'; 9R 

20 J. :;r^'.R-^rs5L 

4 . S ? ? R R ? 0 1 3 7 G D 1 c 7 3 r EI 2 1 0 * . ' 

z > ; 9 0 9 6 L El 9 9 '■' 1 5 7 A T 2 o 7 I 

z. D137GrD157GTl210V 

7 . ^:9 4K+D9dL-^E99K^V137A 

2 5 c, D57G-fN94RH-r9 5LrQ2 4 9R 

9. :;94rC-fD95L-E:99K-H:210V 

10 . S??RR?-rA19G-i-G3 5.-.-rK9 4K-rD9cl 

11. i;94R-^F95L 

12 . D137GtD1o7G-hE:210V 
30 13/ :;94K^D95L-Q249R 
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14 . SPPRRP+Q15P+A19C+C35A+N94K+D96L 

15. SPPRRP+N94K+D96L+T231R+N233R+D23 4R+Q24 9R 

16. D137G+D167G+E210V 

17 . SCIRR+N94K+D96L+Q249R 
5 18. N94K+D96L+E99K 

19. N94R+r95L 

20. SPPRRP+N94R+F95L+F113S+Q249R 

Nearly all mutations where represented (19 of 20) indicat- 

10 ing little bias for specific templates. 



Statistics: 


Not shuffled 








10 


Shuffled between at 


least 


2 


templates 


8 


Shuffled between at 


least 


3 


templates 


2 



The shuffled sequences can then be subcloned from the E,coli Tk 
vector into the yeast vector pJS026 as a BanHI-Xbal fragment 
15 (see WO 97/07205), and e.g. screened for new shuffled sequences 
with improved performance in detergents (see WO 97/07205). 



EKAi'iPLE 2 
Method 2 : 

?CR products of 10 different lipase variant genes were 
generated as above and pooled in equimoiar amounts. 



The following 10 m;utant genes were shuffled. 

1' . D137G + D167G + E:210V 

25 2 ' . D96L+E99KtV187A 

3' . .N94K+D96L-hE:99K 
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4 ' . SPPRRP+D57G+N94K-fD96L-fQ249R 
5*. DlllN+r211A+G225P 

6' . SPPRRP+N94K+D95LtT231R+N233R-^D234R+Q24 9R 
7 ' . SPPRRP+A19C+C35A+N94K+D96L+Q249R 
8 • . STPRRP + N94R 
9' . N94R+F95L 

10'. SCIRR+N94K4-D96L+E239C+Q249R 

The following mixture was generated in a suitable tube: 

1 ^l PGR mixture (0.1 ^g) , decamer random priner (30: 
pmol), 2 ^il 10* Klenow buffer (Promega), 0.25 nuM dNT?, 2.5 r-X 

MgCl, in a total volume of 20 ul. 

The mixture was set in a PE2400 thermocycler where the 
following program was run: 95=C, 5 minutes, 25°C 5 minutes, 0.: 
ml Klenow enzyme was added, 25^C 60 minutes, 35°C 90 minutes . 

This procedure generateu 2 high nu.T^Der of small DNA poly- 
mers originating from all parts of the gene 

10 lal was taken out for test on agarose gel, 
10 |.il ?CR mixture (0.25 cNT?, 1 )il 10* Taq buffer 

(Perkin Elmer), 2.5 r:-: XrCl., 0.5 jal Taq enzyr.e) was added to 
the 10 !ul in the tube rr. the thermocycler. Then the following 
standard PCR-procram was run: (94"C, 5 minutes) 1 cycle, (94^C 33 
seconds, 45°C,. 30 sezov.cs, 72"C 30 seconds) 25 cycles, 72°C 7 

m.inutes, 4°C indefinite. 

The PGR products were run on a 1.5% agarose gel. A clear 
unbiased smear was seen, DU?-. betv;een 400 and 800 bp was isolatec 
f rom the gel . 

Half of the purified ?Z': product was mixed in a tube wit 
two 'specific primers (40 c:-:.!; flanking the gene of interest 
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0.25 mM dNTP, 2 )il 10^ Taq buffer, 2,5 mM MgClj. Then the fol- 
lowing standard PCR-program was run: (94°C , 5 minutes) 1 cycle, 
(94°C 30 seconds, 50°C, 30 seconds, 72^C 30 seconds) 25 cycles, 
72°C 7 minutes, 4°C indefinite. 
5 The PGR product was run on a 1.5% agarose gel. A specific 

though weak band of the expected size was isolated. Additional 
PGR was run using specific primers (as mentioned above) in order 
to amplify the PCR-product before cloning. 

The PGR-product and the desired vector were cut with the 
10 appropriate restriction enzymes (Bamiil /Xhol ) . The vector and the 
PGR product were run on a 1.5^ agarose gel, and purified fror, 
the gel . 

The cut PGR-product and the cut vector were mixed in a li- 
gase buffer with T4 DNA ligase (?ror:ega). After overnight liga- 
15 tion at 16°G the mixture was transformed into E. coll strain 
DH5a . 

IS clones v-ere fully sec'^er:ced across the gene. 



Kesuit ; 



f o 1 1 o V/ i n g 19 



I ounc : 



1 


ST?RR?tM94Rt;-;233 


R-rD234R-Q249R 


2 . 


5??RR?-i-N233R*D23 


4R-Q249R 


3 . 


D9cLrD167G-^n24 9R 






N94R 




5 . 


Dlo7G 




6, 


S P ? RR ? A 1 9 G A 2 c T 


-^N9 4K-rD9 5L-r:ilN-^E2 3 9G 


7 . 


S??RR?rA19GrC35A 


T>i94:C^D96L-:^24 9R 


S . 


N94KrD3oL 




9. 


t-;25T-f D57G-fN94R-^E: 


9 9K^Di67G'fT2 31R-^N2 3 3R-fD2 3 4R + Q2 4 9R 
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10. N94R+Q249R 

11. D167G 

12. D167G 

13. T32I+N94R+F95L+D167G+Q249R 

14. E87K+N94K+D96L 

15. N94R+F95L+Q249R 
lb. N94K+D95L+D111N 

17. STPRRP+S17T 

18. N94K+D96L+V187A 

19 . SPPRRP+D57G+N94K+D9 5L+D111N+L151S 

Ml template variants were represented indicating littl 
bias for specif ic 'templates . 



There were no apparen 


t hot spots with 


regard 


to 


mu ta t ion 


exchange and it seems to be 


evenly distributed along 


the 


gene 


Statistics : 


Mot shuffled 










Shuffled between at least 


2 temtolates 


10 






Shuffled between at least 


3 templates 


6 






Shuffled betv^een at least 


4 templates 


1 






Shuffled between at least 


5 uemC'lates 


0 






Shuffled between at least 


6 tem.plates 








The shuffled s e c u e n c 


es can then be 


subclon 


ed 


from the 


£ , col i T A vector into t n e 


veast vector pJS025 as 


a EamHI-Xbal 


fragment (see WO 97/07205). 


and e.g. screen 


ed for 


new 


shuffled 



sequences with inproved perfor-.ance in detergents (see v;0 
97/07205) . 
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Example 3: 

Air.ylase variant shuffling: 

In Example 1, it was shown how a number of multiple vari- 
ants of H. lanuginosa lipase were shuffled. In a similar manner 
variants of Bacillus a-amylases can be shuffled. 

Earlier patent applications have identified variants of 
various a-amylases froT. Bacillus species improved for particula 
properties, e.g. thermostability, stability under Calcium- 
depleted conditions, improved wash-performance etc. (see 
WO95/10603, W096/2387^ W096/23873, and PCT/DK97 /OO 1 97 ) . 

Variants of S. licheniforriis a-amylase amyL can be shuf- 
fled as follows. The variants are all located in the S. subtilll 
expression vector pDN1528 described in WO95/10503. 

The experiment is carried out under the exact same condi- 
tions as Example 1 except that the flanking 27mer primers used 
to initiate DNA synthesis were different. 

The ?CR ar.piifiec banc of approximately 1500 bp is puri- 
fied from an agarose gel and cloned as described in Examplel. 
Alternatively restriction sites located within the amyL gene ca 
be utilized to clone the library of shuffled genes into either 
3:^c-*Uus olasmid oDN1 523 or an E. coli vector containing the 
wild tvoe amvL gene, e.g. pJeENl described in W096/23874 
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Shuffling of two genes encoding homologous a-amylases: 
amyL and the amylase identified by SEQ.ID no2 (amino acid) and 
SEQ.ID no. 5 (DMA) described in W096/23873. 

5 The forward strand (identical to the mRNA) of amyL can be ampli- 
fied in a PGR using standard concitions. 

The forward strand is separated from the reverse strand 
based on its affinity to streptavidin coated magnetic beads and 
denaturation of the two strands with NaOH. 
10 Similarly the reverse strand (complementary to mRNA) of the amy- 
lase encoded by SEQ.ID no5 (W096/23873) can be amplified and 
isolated 

Two primer strands are used as templates in a PGR: (94 °G 5 
minutes) ^ 99 cycles of (94°C, 30 seconds; 60°C, 0 seconds) +• 
15 (72°G, 5 minutes) using random primers of various lengths, Taq 
polymerase and standard buffer conditions as described in Exam- 
pie 1. 

The resulting approxi-ately 1500 bp product is cloned ei- 
ther as TA cloning as described in Example one (for verification 
20 of the sequence of the resulting clone) or into Bacillus vector, 
pTVBllO utilizing Sfil and ?stl restriction sites. 

The original terr.piaue can be removed from, the PGR at any 
step (e.g. after 5, 10 or 20 cycles) based on the biotin tag). 
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PATENT CLAIMS 

1. a method for the construction of a library of recombined 
homologous polynucleotides from a number of different starting 
single or double stranded parental DNA templates and primers by 
induced template shifts during an in vitro polynucleotide syn- 
thesis using a polymerase, whereby 

A. extended primers or polynucleotides are synthesized by 

a) denaturing a parental double stranded DNA template to 
produce single stranded templates, 

b) annealing said primers to the single stranded DMA 
templates, 

c) extending said primers by initiating synthesis by use 
of said polyrr.erase, 

d) cause arrest of the synthesis, and 

e) denaturing the double strand to separate the extended 
primers from the te.T.plates, 

3. a template shift is induced by 

a) isolating the nevly synthesized single stranded ex- 
tended primers frc~ the ter?.plates and repeating steps 
A.b) to A.e) usir.r said extended primers produced in 
(A) as both prirr.ers and templates, or 

b) repeating steps A.b) to A.e), 

C. the above process is ler-inated after an appropriate num- 
ber of cycles of process steps A. and B.a), A. and 3.b), 
or coTJDinations thereof, and 

D. optionally the produced polynucleotides are amplified in a 
standard PGR reaction witr. specific primers to selectively 
amplify homologous polynucleotides of interest. 

2 The method of claim 1, v/herem said polymerase is a ther- 

mostable DNA polymerase, such as Taq, Amplitaq, Vent, Pwo. 
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3. The method of claim 1, wherein said polymerase is a DNA 
polymerase, such as T4 polymerase, T7 polymerase , E. coli DNA 
polymerase I or the Klenow fragment of DNA polymerase I, and 
said polymerase is added to the reaction mixture after each cy- 
cle . 

4. The method of claim 1, wherein said polymerase is a RNA 
polymerase . 

5. The method of any of the claims 1 to 4 , wherein said po- 
lymerase is an error-prone polynp.erase , such as a reverse tran- 
scriptases like HIV reverse transcriptase. 

6. The method of any of the claims 1 to 5, wherein said 
starting single or double stranded parental templates are dif- 
ferent in that they contain different point mutations in the 
same native polynucieot ice (cer.e), or are homologous polynu- 
cleotides (genes) isolated fro- nature. 

7. The method of clai- 5, wherein said starting single or 
double stranded parental te-piates exhibit an identity at the 
DN.-. level of e.g. more than 95'., SO^, 85%, 80%, 75%, 70%, 65%, 
60%, 55%, or even rr.ore than 50% identity. 

8. The method of any of the claims 1 to 7, wherein the ar- 
rest of the polymerase reaction in step .-..d) is achieved by 
raisina the temperature. 



30 



9. .The method of claim S, v;r.erein the temperature is raised 
to a value between SO°C and S9=C, preferably 94°C. 
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10. The method of any of the claims 1 to 1, wherein the ar- 
rest of the polymerase reaction in step l.d is achieved by add- 
ing specific reagents, such as DMSO. 

5 

11. The method of any of the claims 1 to 10, wherein said 
primers comprise a population of completely random primers. 

12. The method of any of the claims 1 to 10, wherein said 
10 primers comprise a population of semi-random primers. 

13. The method of any of the . claims 1 to 10, wherein said 
primers comprise a population of specific primers. 

15 14. The method of any of the claims 1 to 13, wherein said 
orimers are labeled, preferably with biotin or digoxigenin . 

15. The method of any of the claims 11, 12 or 14, wherein 
said random or seni-rancorr. primers have a length of from 6 to 
20 500 bp, preferably from 15 to 300 bp, and better from 25 to 150 



DO . 



15. The method of ar^y of the claims 13 or 14, wherein said 
primers are 6 to 75 bp long, 

25 

17. The method of any of the claims 1 to 16, wherein said 
starting parental DN.^. templates are at least 50%, 60%, 70%, 
80%, 90%, or 95% hom.ologous. 

30 18. ,The method of claim 17, v;herein said starting parental 
DN.^. Templates originate from v;ild type organisms of different 
genera 



W098/41653 36 PCT/DK98/001O4 



19. The method of any of the claims 17 or 18, wherein said 
starting parental templates have a length of from about 50 bp 
to 20 kb, preferably about 100 bp to 10 kb, more preferred 
about 200 bp to 7 kb, especially about 400 bp to 2 kb 

20. The method of any of the claims 1 to 19, wherein said 
starting parental templates are linear DMA fragments generated 
by a PGR reaction. 

21. The m.ethod of any of the claims 1 to 20, wherein said 
starting parental templates are , cloned into suitable vectors, 
such as a plasmid. 

22. The method of any of the ciair^s 1 to 21, wherein said 

starting parental terr.plates represent a polynucleotide encoding 

an enzyne, such as a carbonyl hydrolase, carbohydrase , or es- 

.,,^v, p n-o^eas^ ^^oase, a-vlase, cellulase, oxi- 

terase, such cs a p^o^fcici:^-, , 

dase, oxido reductase, etc. 

^ r--- cla*"-s 1 to 21, wherein said 

23. The "etnod or ciny c- u:.e cxc-...^^ 

^.,1 .^.^^ip-oc -oo-os^nt a Dolvnucleotide encoding 
startinc oarentai Le:..p^ci._s 

a oolyoeotide having biological activity, such as insulin, 

ACTH, glucagon, sor^atostat soraa totropm, thyi^osm, parathy- 

K r,^-v^-^' ^o-r-^-'^s, sc"ato:?.edin, erythropoietin, 

roid horiTione, pj-^ua^v ..c^i..^..-o , 

^u^^-^^;^ ^.-np.-^-^^ -oo^ n, hyoothalamic re- 
luteinizing horrr.one, cnorion^c ^'^na-0--op*iw 

• -^..v-^-^- -^^^^^.os thyroid stimulating 
leasing factors, anticiure^ic .:^...o..-s , uuy. 

■ ^ ^^-o>-'-- - --o-jDOOO^^ tin (TPO) and prolac- 
hormone, reiaxm, m^^r^r .e.o.:, o.-J^-po^^ lx 

tin . 

. . ^'^^ ^ipTT-.c "I to 21, wherein said 

24. The rr.ethoG of any o. u..^ ^ic:x...s . lu ^x, 

^ T a^^^c: -^.n-^s^nt a Dolynucleot ide contain- 

startmg parental tempia-^es ^ep^es-.n- c u^xy 
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ing a biological function such as transcription initiation 
(promoter) or termination, translational initiation, operator 
sites related to expression of genes 

5 25. The method of any of the claims 1 to 24, wherein said 
number of cycles are less than 500, 200, or less than 100 cy- 
cles . 

26. A method of identifying a polypeptide of interest exhib- 
10 iting improved properties in comparison to naturally occurring 

or other known polypeptides of the same activity, whereby a li- 
brary of recombined homologous .polynucleotides produced by a 
process according to any of the claims 1 to 25 are cloned into 
an appropriate vector, said vector is transformed into a suit- 
15 able host system, to be expressed into the corresponding 
polypeptides and displayed, and said polypeptide is screened in 
a suitable assay, and positive polypeptides selected. 

27. A method for producinc a polypeptide of interest as iden- 
20 tified according to ciairr: 26, whereby a vector comprising a 

polynucleotide encoding said identified polypeptide is trans- 
formed into a suitable host, said host is grown to express said 
polypeptide, and the pol^/peptide recovered and purified. 
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